iTAG: Automatically Annotating Textual Resources with Human Intentions
نویسنده
چکیده
Annotations represent an increasingly popular means for organizing, categorizing and finding resources on the “social” web. Yet, only a small portion of the total resources available on the web are annotated. Work on automatic tag generation algorithms aims to tackle this problem by developing algorithms that attempt to approximate and support human tagging behavior. While existing algorithms largely focus on automatically describing the general topics covered by a resource (such as “career”, “education”), we suggest focusing on a different tagging dimension: i.e. automatically annotating resources with human intentions. Intent annotations aim to describe which goals are referenced in given textual resources (such as “find a job”, “get a degree”), thereby offering a new, interesting perspective on textual resources on the web. We describe a prototype – iTAG – for automatically annotating textual resources with human intent, and investigate the extent to which the automatic analysis of human intentions in textual resources is feasible. For evaluation purposes, we present results from an exploratory study that focused on annotating intent in transcripts of political speeches given by US presidential candidates in 2008.
منابع مشابه
Automatic Annotation of Bioinformatics Workflows with Biomedical Ontologies
Legacy scientific workflows, and the services within them, often present scarce and unstructured (i.e. textual) descriptions. This makes it difficult to find, share and reuse them, thus dramatically reducing their value to the community. This paper presents an approach to annotating workflows and their subcomponents with ontology terms, in an attempt to describe these artifacts in a structured ...
متن کاملAnnotating Genes Using Textual Patterns
Annotating genes with Gene Ontology (GO) terms is crucial for biologists to characterize the traits of genes in a standardized way. However, manual curation of textual data, the most reliable form of gene annotation by GO terms, requires significant amounts of human effort, is very costly, and cannot catch up with the rate of increase in biomedical publications. In this paper, we present GEANN,...
متن کاملA Review on Corpus Annotation for Arabic Sentiment Analysis
Mining publicly available data for meaning and value is an important research direction within social media analysis. To automatically analyze collected textual data, a manual effort is needed for a successful machine learning algorithm to effectively classify text. This pertains to annotating the text adding labels to each data entry. Arabic is one of the languages that are growing rapidly in ...
متن کاملTowards Automatic Image Region Annotation - Image Region Textual Coreference Resolution
Detailed image annotation necessary for reliable image retrieval involves not only annotating the image as a single artifact, but also annotating specific objects or regions within the image. Such detailed annotation is a costly endeavor and the available annotated image data are quite limited. This paper explores the feasibility of using image captions from scientific journals for the purpose ...
متن کامل"Ask Not What Textual Entailment Can Do for You..."
We challenge the NLP community to participate in a large-scale, distributed effort to design and build resources for developing and evaluating solutions to new and existing NLP tasks in the context of Recognizing Textual Entailment. We argue that the single global label with which RTE examples are annotated is insufficient to effectively evaluate RTE system performance; to promote research on s...
متن کامل